A New Eecient Radix Sort
نویسندگان
چکیده
We present new improved algorithms for the sorting problem. The algorithms are not only eecient but also clear and simple. First, we introduce Forward Radix Sort which combines the advantages of traditional left-to-right and right-to-left radix sort in a simple manner. We argue that this algorithm will work very well in practice. Adding a preprocessing step, we obtain an algorithm with attractive theoretical properties. For example, n binary strings can be sorted in n log B n log n + 2 time, where B is the minimum number of bits that have to be inspected to distinguish the strings. This is an improvement over the previously best known result by Paige and Tarjan. The complexity may also be expressed in terms of H, the entropy of the input: n strings from a stationary er-godic process can be sorted in ? n log ? 1 H + 1 time, an improvement over the result recently presented by Chen and Reif. 1 Background A common idea in the design of eecient algorithms is to take the distribution of the input into account. Well known examples are interpolation search 9, 21, 22], trie structures 8, 10, 13], and bucketing algorithms 6, 7]. When describing the complexity of such algorithms, it is common to rely on the assumption that the input elements are independently and randomly drawn from a speciic distribution, for instance the uniform or normal distribution. A more general approach is to assume no particular distribution, but to express the complexity of the algorithm in terms of certain properties of the input. In this article, we study this more general approach in the context of sorting. Speciically, we study the problem of arranging binary strings in lexicographic order. A new approach to this problem was recently presented by Chen and Reif 4]. They introduced a randomized sorting algorithm for binary strings from a stationary ergodic process (see Section 3.2) with en-tropy H, yielding an expected cost n log log n H + 2 : This was claimed to be the rst algorithm to adapt to an unknown input distribution. We take a more general approach based on distinguishing preexes. Let B denote the total length of all distinguishing preexes, i.e. the minimum number of bits that have to be examined in order to sort the strings. Let B = B=n. For a stationary ergodic process satisfying a certain mixing condition (see Section …
منابع مشابه
The Effect of Local Sort on Parallel Sorting Algorithms
We show the importance of sequential sorting in the context of in memory parallel sorting of large data sets of 64 bit keys. First, we analyze several sequential strategies like Straight Insertion, Quick sort, Radix sort and CC-Radix sort. As a consequence of the analysis, we propose a new algorithm that we call Sequential Counting Split Radix sort, SCS-Radix sort. SCS-Radix sort is a combinati...
متن کاملModified Pure Radix Sort for Large Heterogeneous Data Set
We have proposed a Modified Pure Radix Sort for Large Heterogeneous Data Set. In this research paper we discuss the problems of radix sort, brief study of previous works of radix sort & present new modified pure radix sort algorithm for large heterogeneous data set. We try to optimize all related problems of radix sort through this algorithm. This algorithm works on the Technology of Distribute...
متن کاملConscious Radix Sort
The exploitation of data locality in parallel computers is paramount to reduce the memory traac and communication among processing nodes. We focus on the exploitation of locality by Parallel Radix sort. The original Parallel Radix sort has several communication steps in which one sorting key may have to visit several processing nodes. In response to this, we propose a reorganization of Radix so...
متن کامل‘Review of Radix Sort & Proposed Modified Radix Sort for Heterogeneous Data Set in Distributed Computing Environment’
We have proposed a Modified Pure Radix Sort for Large Heterogeneous Data Set. In this research paper we discuss the problems of radix sort, brief study of previous works of radix sort & present new modified pure radix sort algorithm for large heterogeneous data set. We try to optimize all related problems of radix sort through this algorithm. This algorithm works on the Technology of Distribute...
متن کاملPartitioned Parallel Radix Sort
Load balanced parallel radix sort solved the load imbalance problem present in parallel radix sort. By redistributing the keys in each round of radix, each processor has exactly the same number of keys, thereby reducing the overall sorting time. Load balanced radix sort is currently known as the fastest internal sorting method for distributed-memory multiprocessors. However, as the computation ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994